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Abstract 

We find simple saturated alcohols with the given number of carbon atoms and 
the minimal normal boiling point. The boiling point is predicted with a weighted 
sum of the generalized first Zagreb index, the second Zagreb index, the Wiener index 
for vertex-weighted graphs, and a simple index caring for the degree of a carbon 
atom being incident to the hydroxyl group. To find extremal alcohol molecules 
we characterize chemical trees of order n, which minimize the sum of the second 
Zagreb index and the generalized first Zagreb index, and also build chemical trees, 
which minimize the Wiener index over all chemical trees with given vertex weights. 


1 Introduction 

Consider a collection of admissible molecules (for example, represented with their 

structnral formulas or chemical graphs), each endowed with k + 1 signihcant physical 

or chemical properties (e.g., normal density, normal boiling point, refraction coefficient, 

retention index, or more exotic and problem-specihc ones), and let Pi{G), i = 0,k, he 

the numeric value of the i-ih property of a molecule G G (e.g., the normal boiling point 
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value). A typical problem of molecular design is the following optimization problem: 


Po{G) —min (max) (1) 

Geo Geo 

Pr" < PiiG) ^ Pr",i = 

When the functions Pi{-) are only partially known from the experiment, they are re¬ 
placed with predicted hgures, relating a chemical graph G G to the predicted value 
Pi{G) of the i-th physical or chemical property (i = 0,fc) by virtue of numeric charac¬ 
teristics (known as molecular descriptors), which can be calculated on basis of a molecu¬ 
lar structure. A typical quantitative structure-property relation (QSPR) includes several 
molecular descriptors, and is presented as 

P,{G) = PiihiG ),..., Im{G)),t = 0,...,k, 

where /i(G),..., ImiG) are the values of molecular descriptors for the molecular graph G. 
The simplest linear regression is just a weighted sum of descriptors: 

Pi{G) = ai^iIi(G) -|-... -t- am,iIm{G), i = Q, 

During recent decades a number of topological, geometrical, and quantum-mechanical 
molecular descriptors were suggested and studied Below we limit ourselves 

to topological descriptors only (see, for instance, the handbook [^) to study problem ([^ 
as a problem of the extremal graph theory . 

Exhaustive enumeration of all feasible molecules (the brute force approach) can only 
be used to solve this problem when the feasible set is relatively small; for bigger sets 
mathematical chemistry suggests a variety of limited search techniques. In numerous 
papers lower and upper bounds of dozens topological indices over various feasible sets were 
obtained [5jff p8p9p^26[|28j - [^ , and, in many cases, extremal graphs were characterized. 
At the same time, the problem ([^ of optimization of a composition of indices is still 
understudied. 

In fact. Ending lower and upper bounds of individual indices can be a step towards 
solving problem ([^, as a linear combination of lower bounds is a lower-bound estimate of 
the combination of indices. This estimate can be used in a branch-and-bound algorithm 
of limited search. Yet, the quality of the estimate may be considerably poor, resulting in 
lack of efficient cuts in a branch-and-bound algorithm. 


Anyway, the common shortcoming of an algorithmic approach to index optimization 
is that it does not support the analysis of general characteristics of an extremal molecule 
(i.e., of a corresponding graph). When available, side information would be of great value 
on why a certain graph is optimal or not, what shape the extremal graphs have, etc. Such 
information is revealed using analytical tools of discrete optimization. 

In this paper we apply recent results in optimization of degree- and distance-based 
topological indices to hnd a simple saturated alcohol with the given molecular weight 
and minimal boiling point. We reduce the property minimization problem to that of 
minimization of a weighted linear combination of the generalized hrst Zagreb index, the 
second Zagreb index, the vertex-weighted Wiener index, and a simple index caring for 
the degree of a carbon atom being incident to the hydroxyl group. Then we characterize 
minimizers of this linear combination of indices (see Fig. and of a couple of simpler 
regressions (see Fig. [^and|^. 


2 Predicting Boiling Points of Simple Alcohols 


The normal boiling point of a liquid is determined by its solvation free energy. The 
solvation free energy can be predicted with high accuracy from computer simulations 
(see [^1^ for details). The simulation-based approach solves well the “direct problem” of 
predicting the solvation free energy for a given molecule, but it does not help solving the 
“inverse problem” of hnding the molecule having the minimal solvation free energy (and, 
consequently, the boiling point). For this reason we predict boiling points of simple sat¬ 
urated alcohols (those having a general formula C„H 2 „+iOH) with the aid of topological 
indices. 

Alcohols have relatively high boiling points when compared to related compounds due 
to hydrogen bonds involving a highly polarized hydroxyl group, and branched isomers 
have lower boiling points than alcohols with the linear structure. Another structural 
feature affecting the boiling point is the “oxygen shielding” effect j^, when atoms sur¬ 
rounding the hydroxyl group partially shield it preventing formation of hydrogen bonds 
between molecules and, thus, decreasing the boiling point. 


We considered several degree-based topological indices (the hrst Zagreb index Mi 12 


the second Zagreb index M 2 12 , Randic index 22 and the others), which are known to 


be good metrics of branchiness, and, hnally, the generalized hrst Zagreb index Ci (see 





also [^) has shown the best results: 


Ci(G)= 5^ c(dc{v)), (2) 

veViG) 

where V{G) is the vertex set of graph G, dciv) is the degree of the vertex v G V{G) 
in graph G, and c{d) is a non-negative function dehned for degrees from 1 to 4. It can 
alternatively be written as 


GiiG) — c(l)ni(G) -|- c(2)?7-2(G) -|- c{?>)n 2 ,{G) c(4)n4(G), (3) 


where UiiG) is the atoms’ count of degree i = 1,...,4 in a molecular graph G, and 
c(l),c(4) are regression parameters. 


We also employed the classical second Zagreb index 12 


M2{G) = ^ dG{u)dG{v), 

uveE{G) 


( 4 ) 


where E{G) is the edge set of graph G. 

Another index used was the Wiener index, which had been the hrst topological index 


for boiling point prediction 27 due to its high correlation with the molecule’s surface 
area. To account for heterogeneity of atoms we allow each pair of vertices u,v ^ V (G) 
to have unique weight ^g{u,v) and calculate the pair-weighted Wiener index as 


PWWI{G):=^ f^G{u,v)dG{u,v), (5) 

u,veV{G) 

where dG{u,v) is the distance (the length of the shortest path) between vertices u and v 
in G. For example, we can assign different weights to distances between pairs of carbon 
atoms and between carbons and the oxygen atom in an alcohol molecule. 

Regression tuning has shown the distances between carbon atoms to be irrelevant for 
the alcohol boiling point, and only distances to the oxygen matter. All such distances are 
accounted with equal weight, so the pair-weighted Wiener index reduces to the distance 
of the oxygen atom, which was hrst used for the alcohol boiling point prediction in [^: 


W/o(G):= dG{u,0). 

u&V{G) 


( 6 ) 


In 21 a geometrical descriptor has also been suggested to account for oxygen shield¬ 


ing, but we extend the approach by 15 instead, and introduce a simple topological index 






Table 1: Data sample: basic statistics 


carbons 

# isomers 

min chain len. 

max chain len. 

min b. p. 

max b. p. 

2 

1 

2 

2 

78 

78 

3 

2 

2 

3 

82.5 

97 

4 

4 

2 

4 

82.4 

117.5 

5 

8 

3 

5 

102 

137 

6 

17 

3 

6 

120 

157 

7 

18 

4 

7 

131.5 

175.5 

8 

10 

4 

8 

147.5 

194 

9 

11 

4 

9 

169.5 

215 

10 

5 

5 

10 

168 

231 

11 

2 

10 

11 

228.5 

243 

12 

1 

12 

12 

259 

259 

TOTAL 

79 

2 

12 

78 

259 


Si{G), which is equal to unity when the carbon atom incident to the hydroxyl group in 
the alcohol molecule G has degree z = 2,3,4 (we exclude methanol from consideration), 
and is equal to zero otherwise. 

We collected a data set of experimental boiling points under normal conditions for 
79 simple saturated alcohols having from 2 to 12 carbon atoms and representing various 
branchiness. Several data sources [8,15,17,21 were combined with priority on Alpha 
Aesar experimental data to resolve discrepancy. In Table [T] we present basic statistics 
about the data sample. 

Information on boiling points of alcohols including more than 12 carbon atoms is 
less common and reliable. The complete data set together with the best regressions is 
available online at (II] . 

We randomly split the sample into the training set containing 50 cases and the testing 
set containing 29 cases. Then we examined different linear regressions involving the 
descriptors mentioned abovej^ The best performance and predictive power was obtained 
for the linear combination of the oxygen’s distance cube root WIo{G)^ (with weight h\), 
the generalized hrst Zagreb index Gi{G) (with weights c°(l),..., c°(4)), the second Zagreb 
index M 2 {G) (weighted by 63 ), and the simple indicator of the sub-root’s degree S 2 {G) 


^ChemAxon Instant JChem® was used for index calculation. Authors would like to thank 
ChemAxon® Ltd (http://www.cheniaxon.com) for the academic license. 





























Table 2: Precision of regressions: Correlation coefficient and standard deviation (SD) 


Data set 

Basic regression (“°”) 

Regression I () 

Regression II (“i^”) 

Corr. 

SD, °C 

Corr. 

SD, °C 

Corr. 

SD, °C 

Training set 

0.997 

2.98 

0.997 

3.12 

0.996 

3.26 

Testing set 

0.996 

3.23 

0.995 

3.58 

0.994 

3.99 


Table 3: Parameters of regressions 


Coefficient 

Basic regression (“O”) 

Regression I () 

Regression II (“^^”) 

bo 

35.245 

50.626 

44.134 

bi 

12.233 

— 

3.851 

b2 

9.170 

11.295 

10.980 

bs 

1.486 

1.000 

— 

c(l) 

— 

— 

— 

c( 2 ) 

9.514 

14.534 

17.727 

c(3) 

9.380 

20.172 

29.673 

c(4) 

— 

17.015 

36.470 


(weighted by b^): 


BP^ = 6 ° + h\WIo{GY + c\2)n2{G) + c°(3)n3(G') + h^S^iG) + hlM^iG). (7) 


Below this regression is referred to as the basic one. The optimal values of weights 
6 ?, c°(l),c°(4), 62 , (including the constant term 6 q) calculated with the least squares 
method are presented in the hrst column of Table Parameters c(l) and c(4), which 
weight variables ni{G) and n 4 {G) respectively, appear to be insignihcant and can be 
set to zero in ([^ when calculating the generalized hrst Zagreb index. The precision of 
the basic regression is shown in Table It is comparable to the best known relations 


8,15,17,21. In the following sections we show how far we can come in analytical and 


numeric minimization of this combination of indices. 

We also considered two simplihcations of regression ([^, for which alcohol molecules 
having minimal predicted boiling point can be characterized analytically. The hrst one 
(below referred to as “Regression I”) is obtained by withdrawing WIo{G) in ([^: 


BP^ = bl + c\2)n2{G) + c^(3)n3(G') + c^(4)n4(G) + blS2iG) + 6^M2(G). (8) 

See Table for the precision hgures of regression ([^ under the values of parameters 
delivering the best approximation to the training set (see the middle column of Table [^. 








































The second simplified regression (referred to as “Regression II” below) is obtained by 
withdrawing M 2 {G) in ([^: 

= 6“ + 6”lV/o(G)^ + ^\2)n2{G) + c"(3)n3(G) + c”(4)n4(G) + 6"52(G). (9) 

In Table we show its precision under the optimal values of parameters depicted in 
the last column of Table El 

The shortcoming of Regression II is that the term WIo{G)^ appears to be insignificant 
after disposal of M 2 {G), being responsible of approximately 1 per cent of the residual sum 
of squares. Nevertheless, we keep this regression for illustration of joint optimization of 
Gi{G) and of the pair-weighted Wiener index. 

3 Minimization of indices and their combinations 

In the present paper we find a simple saturated alcohol isomer with n — 1 carbon atoms 
having the lowest predicted boiling point. As the regressions introduced in the previous 
section are tested only for alcohols containing from 2 to 12 carbon atoms, we restrict our 
attention to n ^ 14, where we can expect some accuracy of the obtained results. 

For n ^ 14 admissible sets of all simple saturated alcohol molecules with n —1 carbons 
are not too extensive, and allow for the brute-force enumeration. Moreover, we are sure 
that no aid of a computer is needed for an organic chemist to draw a molecule being a 
good approximation to the boiling point minimizer for all n ^ 14. However, our aim is to 
show how analytic optimization techniques formalize the professional intuition and help 
making general conclusions of verifiable reliability. 

Let us characterize chemical trees minimizing indices introduced in the previous sec¬ 
tion and their combinations. 

3.1 Degree-based indices 

For a simple connected undirected graph G denote with W (G) the set of pendent vertices 
(those having degree 1) of the graph G, and with M{G) := V (G)\hF(G) the set of internal 
vertices (with degree > 1) of G. 

Definition 1 A simple connected undirected graph of order n is called a chemical tree 
if it has n — 1 edges and its vertex degrees do not exceed 4. Denote with T{n) the set of 
all chemical trees of order n. 


□ 


Definition 2 A pendent-rooted chemieal tree is a chemical tree, in which one pendent 
vertex is distinguished and called a root. A typical pendent-rooted tree is denoted with 
Tr, with r being its root. A vertex being incident to the root in is called a sub-root 
and is denoted as sub(Tr). Denote with 7l{n) the set of all pendent-rooted chemical trees 
of order n. □ 

Clearly, the set f2(n — 1) of all molecules of simple saturated alcohols having n — 1 
carbons coincides with the set TZ{n) of pendent-rooted chemical trees of order n (with 
a root corresponding to the hydroxyl group and the other vertices forming the carbon 
skeleton of a molecule). 

For a topological index /(■) defined on an admissible set of graphs Q introduce the 
notation Ig := min^gg and let Q*j^ := Argminj^^g I{G) be the set of graphs minimiz¬ 
ing /(■) over Q. For example, T^^{n) is the set of chemical trees of order n minimizing 
the second Zagreb index M 2 (-). 

Define also the set IZiiji) := {T G lZ{n) : dT{sub(T)) = i} of all pendent-rooted trees 
with a sub-root having degree i = 2, ..., 4. 

We start with the following obvious statement. 

Lemma 1 Si{G) aehieves its minimum at any pendent-rooted ehemieal tree with sub¬ 
root’s degree other than i. In other words, = TZ{n)\lZi{n). 

Proof is straightforward, as Si{G) = 1 for all G G IZiin), and Si{G) = 0 otherwise. ■ 

Indices Gi{G) and M 2 {G) do not account for heterogeneity of atoms in a molecule, 
so we can minimize them over the set T(n) of all chemical trees of order n and then 
assign the root to an arbitrary pendent vertex of the index-minimizing tree to obtain a 
pendent-rooted tree, which minimizes the index. 

Consider an “ad-hoc” degree-based topological index 

G{G)-.= G,{G) + h^M2{G)= Y, c{dG{v)) + h^ Y doWciv), ( 10 ) 

v£V{G) uv£E{G) 

where 63 is an arbitrary real constant (we keep notation 63 for compatibility with equations 

0 . 0 ). 

Definition 3 A chemical tree T G T{n) is extremely branched, if its internal vertices 
have degree 4, except one vertex having degree 2 when n mod 3 = 0, or one vertex 
having degree 3 when n mod 3 = 1. □ 


Theorem 1 Assume the following inequalities hold: 


( 11 ) 

( 12 ) 

(13) 


c(l) + c(4) + 1863 < c(2) + c(3), 
c(l) + c(3) + 863 < 2 c( 2 ), 
c(2) + c(4) + 863 < 2c(3). 

If a chemical tree T G T(n) for n ^ 3 minimizes C'(-) over all chemical trees from T{n), 
then T is an extremely branched tree. For n ^ 17 the inequality (0 can he weakened to 

c(l) + c(4) + 1763 < c(2) + c(3). (14) 


Proof We employ the standard argument of index monotonicity with respect to certain 
tree transformations. Assume the theorem does not hold, and vertices u, n G M(T) exist 
such that u ^ V and dT{u),dT{v) < 4. Four cases are possible. 

1. driu) = dT{v) = 2. Let ^ 1,^2 G ViT) be the vertices incident to v in T. Also, let 
mi,M 2 £ 14(T) be the vertices incident to u in T, and U 2 lies on the path to the 
vertex v in T. Without loss of generality assume that 

dr(wi) + dTiuf) ^ dTiyi) + dT{v2). (15) 

Consider a graph T' G T(n) obtained from T by replacing the edge uiu with the 
edge uiv. It is easy to see that T' is a tree. The degree of the vertex u in T' is 
decreased by one, the degree of vertex v is increased by one, therefore, if U2 7 ^ n, 
we have 


C{T') — C{T) — c(l) + c(3) — 2c(2) + b^{dT{ui) + dTiyi) + dTiv 2 ) — dT{u 2 )). 


From (15) we obtain C(T') — C{T) ^ c(l) + c(3) — 2c(2) + 2b^dTiui). Since vertex 
degrees ^ 4 in a chemical tree, from ( [I^ we have C(T') — C(T) ^ c(l) + c(3) — 
2c(2) + 863 < 0, which contradicts the assumption that T minimizes C'(-). 

If U2 = V, in the same manner obtain C'(T') — C{T) ^ c(l) + c(3) — 2c(2) + 763 . 


From (12), it is also negative. 


2. dTiu) = 2, dxiv) = 3. Let Vi,02,03 G I7(T) be the vertices incident to o in T. 
Also, let Ml, M 2 £ ViT) be the vertices incident to m in T, and assume M2 lies on the 




path to the vertex v in the tree T. Consider a tree T' G T(n) obtained from T by 
replacing the edge UiU with the edge Uiv. By analogy to the previous case, if U 2 7 ^ n, 
we obtain C(T') — C(T) = c(l) + c(4) — c(2) — c(3) + b^{2dT{ui) + + dT{v 2 ) + 

driy^) — dT{u 2 ))- Vertex degrees ^ 4 in a chemical tree. Moreover, (ir('W 2 ) ^ 2, 
since it is an intermediate vertex on the path m,M 2 , Therefore, 


C(T') - C{T) ^ c(l) + c(4) - c(2) - c(3) + I 863 , 
and, from ( [II| ), C(T') — C{T) < 0 , which is a contradiction. 


To prove the weaker inequality (14) we are enough to prove that CiT') — CiT) < 0 
for n ^ 17, since dT{ui) = drivi) = dT{v 2 ) = dxiy^) = 4, and dT{u 2 ) = 2 is 
possible only in a tree of order 18 or more (an example is depicted in Fig. [^), and 
C(T') — C{T) = c(l) + c(4) — c(2) — c(3) + I 763 for the tree T depicted in Fig. [^. If 



Figure 1: To the proof of inequality (14) 


U 2 = n, without loss of generality assume that u = ^ 3 . Then C'(T') — C{T) = c(l) + 
c(4) — c(2) — c(3) -\-b^{2dT{ui)-yd^iyi)-\-dTiy 2 ') — 2) ^ c(l) +c(4) — c(2) — c(3) + 1463 . 


From (11) (or from (14), if n ^ 17), it is negative. 


3. The case of driv) = 2, driu) = 3 is considered in the same manner. 

4. dTiu) = dTiy) = 3. Let Vi,V 2 ,V 2 , G V{T) be the vertices incident to v in T. Also, 
let Ml, M 2 , U3 G V(T) be the vertices incident to m in T, with mi not laying on a path 
to V in T. Without loss of generality assume that 


driui) + (ir(w 2 ) + dT{u^) ^ dxivi) + dT{v2) + dT{y^). (16) 

Consider a tree T' {n) obtained from T by replacing the edge mim with the edge 


MiM. If MM ^ E{T), then from (16) and driui) ^ 4 we have 


C {T') — C (T) — c(2) + c(4) — 2c(3) + b^^d^iui) + dj^iyi) + dT{y 2 ) + dx{v^) — dT{u 2 ) 







^ c(2) + c(4) — 2c(3) + 2b^d'j'(^Ui) ^ c(2) + c(4) — 2c(3) + 863 , 


which is less than zero due to (13), and T cannot minimize C(-). If un G -E(T), in 


the same way deduce C(T') — C{T) ^ c(2) + c(4) — 2c(3) + 763 , which is negative. 

The obtained contradictions prove that no more than one internal vertex in T may 
have degree less than 4. 

As T G T{n) and n > 1, the well-known equity holds: 


ni(T) -h 2 n 2 (T) ^n^{T) + 4 n 4 (T) = 2{n - 1). 


(17) 


On the other hand, 

ni{T) + n 2 {T) + n^{T) + ni{T) = n. (18) 

Assume that n 2 {T) = 1 , so that n'^{T) = 0. From (|I^ we have ni(T) +n 4 (T) = n — 1, 


therefore, (17) makes n = 3 -|- 3 n 4 (T) and, since n 4 (T) G Nq, n mod 3 = 0. 

In the same manner we show that if n^{T) = 1 then n mod 3 = 1. If both n 2 {T) and 
n^iT) = 0, then n mod 3 = 2, and the proof is complete. ■ 


Corollary 1 Under conditions of Theorem any tree T minimizing C(T) = Ci{T) -|- 
bsM 2 {T) overTiji) enjoys the same number Ui of vertices of degreei = 1, ...,4. Therefore, 
Ci{T) = Ci{T') for any pair of trees T, T' e Tc{n). □ 

Corollary 2 Under conditions of Theorem^the sets T^{n) forn = 4,..., 14 are depicted 
in Fig. T^in) contains the sole tree for n < 14 , while 7^(14) contains two trees. 

Proof From Corollary we learn that only the value of M 2 (-) may vary within TQ{n). 

From Theorem]^ for n G {5, 8,11,14} an optimal tree is a 4-tree (in which all internal 
vertices have degree 4). Each of ni stem edges (those incident to a pendent vertex) adds 
4 to the value of M 2 , while each of 114 — 1 edges connecting internal vertices adds 16 to 
the value of M 2 . Since ni and 714 are fixed for fixed n, all 4-trees have the same value of 
M 2 (-) (and, therefore, the same value of C(-)). Consequently, for for n = 5,8,11,14 the 
set 7})(n) consists of all 4-trees of order n (see Fig. [^. 

If T G Tq^h), and n G {6,9,12}, one internal vertex u G M(T) has degree ^^(w) = 2, 
while all others have degree 4. For n = 6 only one such tree exists depicted in Fig. It 
is easy to check that M 2 (-) is minimized if vertex u is incident to two internal vertices. 
Only one such tree exists for n = 9 (see Fig. |^, and the same is true for n = 12. 




For n G {4, 7,10,13} any tree T G 7}}(n) has one internal vertex u G M{T) of degree 
dT{u) = 3, while all other have degree 4. For n = 4 only one snch tree exists depicted in 
Fig. in and the same is true for n = 7. Again, it is easy to check that, in the context of 
M 2 {-) minimization, vertex u being incident to three internal vertices is strictly preferable 
to vertex u being incident to one pendent and two internal vertices, which is, in turn, 
preferred to u having two incident pendent vertices. So, optimal trees for n = 10,13 are 
depicted in Fig. (black and white filling of circles is explained below). ■ 

The same logic allows continuing the sequence of C'(-)-minimizers to n > 14. 



Figure 2: Chemical trees minimizing the “ad-hoc” index C{-) for n = 4,..., 14 (alcohol 
molecules minimizing with possible oxygen positions filled with black) 


Therefore, Theorem says that, when conditions (0-(|g) hold, chemical trees min¬ 
imizing C'(-) have as many vertices of maximal degree 4 as possible. A similar result can 
be proved for the modified Wiener index WIo{ ). 

3.2 Wiener index 

A simple connected undirected graph G is called vertex-weighted if each vertex v G V(G) 
is endowed with a non-negaive weight Hcip)- With hg we denote the total vertex weight 
of the graph G, and >VT(n) stands for the set of all vertex-weighted trees of order n. 
Klavzar and Gutman defined the Wiener index for vertex-weighted graphs as 

VWWI(G):=^ ^ fiG(,u)fiG{v)dG{u,v). 

u,v£V{G) 


Clearly, VWWI{-) is a special case of the pair-weighted Wiener index PWWI{ ) 
(dehned with formula (|^) for hg{u,v) := IJ^g{u)ij,g{v). The path-weighted Wiener index 
is poorly studied at the moment, but, fortunately, WIo{-), which is the point of our 
current interest, can be reduced to the Wiener index for vertex-weighted graphs. 

For every alcohol molecule from Q{n — 1) (or, equivalently, for every pendent-rooted 
tree G 'Ti{n)) dehne a vertex-weighted tree T{e) G WT(n,) by assigning the weight 
:= e to each vertex v G V(Tr) (a carbon atom) except the root r, and assigning 
the weight ■= 1/e^ to the root (the oxygen atom). It is easy to see that under these 

weights Imi^^QVWWI{T{e)) = WIoiTr). Since WIo{-) is integer-valued, minimizers of 
VWWI{-) and of WIo{-) coincide for sufficiently small £. 

In the majorization technique suggested by Zhang et ah is used to minimize 
VWWI{ ) over the set of trees with given vertex weights and degrees. Below we recall 
the notation and selected theorems from |^. We use them to find the extremal vertex 
degrees over the set of all trees of order n with fixed vertex weights. 

Definition 4 Consider a vertex set V. Let the function fj. : V ^ M+ assign a non¬ 
negative weight fi{v) to each vertex v E V, while the function d : V ^ N assign a natural 
degree d{v). The tuple {fi,d) is called a generating tuple if the following identity holds: 

2(11/1-1). (19) 

vev 

Denote with J1 := l^{v) the total weight of the vertex set V. Let W’T(p) : = 

{T G WT(|C|) : V{T) = V,iit{v) = p^{v) for all n G C} be the set of trees over the 
vertex set V with vertex weights p(-). For the set yVT{p,,d) := {T G WT{p,) : dTiy) = 
d{v) for all n G C} we also require vertices to have degrees d{-). □ 

Let V{fi,d) be the domain of functions of a generating tuple {ti,d). Introduce the 
set W{fi,d) := {w G V{fi,d) : d{w) = 1} of pendent vertices and the set M{p,,d) : = 
V (/i, d)\W (/i, d) of internal vertices. 

Definition 5 We will say that in a generating tuple (/i, d) weights are degree-monotone, 

if for any m,m' G M{p,,d) from d{m) < d{m') it follows that /i(m) ^ /u(m'), and for any 

w E W{p,, d) we have pi{w) >0. □ 

For a generating tuple (p, d) the generalized Huffman algorithm builds a tree 
H E WT (/i, d) as follows. 


Setup. Define the vertex set Vi := V{fi,d) and the fnnctions /i^ and d^, which endow 
its vertices with weights /i^(n) := /i(n) and degrees d^{v) := d{v), n G Vi. We start with 
the empty graph H over the vertex set V (/r, d). 

Steps i = — 1. Denote with rrii the vertex having the least degree among 

the vertices of the least weight in M{fi\d^). Let Wi, ...,Wd(mi)-i be the vertices having 
d{mi) — 1 least weights in W{fi\d’‘). Add to H edges Wirrii, 

Dehne the set Id+i := Vi\{tci, ..■,Wd(mi)-i} and fnnctions endowing its 

elements with weights and degrees as follows: 

;= for V ^ rrii, := n\mi) + /i*(wi) + ... + /i*(wd(mp-i), 

:= d\v) for v ^ rrii, d^^^{mi) := 1. (20) 


Step q. Consider a vertex G M{ijd,d'^). By construction, \M{ijfl,d^)\ = 1, 
\W{ijfl,d^)\ = dirriq). Add to H edges connecting all vertices from to rrig. 

Finally, set Uniy) := /i(n), v G V{H). 


Theorem 2 |10| If weights are degree-monotone in a generating tuple {ia,d), then T G 
yVTviYWiih'^ '^) */ T E WT{ti,d) and T is a Huffman tree. In other words, 

only a Huffman tree minimizes the IFzener index over the set of trees whose vertices have 
given weights and degrees. □ 


In the present subsection we study how the value of DIFhF/(\; 7 -(/i, d) changes with 


degrees d{-). Our results are analogous to those proved by Zhang et ah 29 for the 


■‘classical” Wiener index. Following 10 , we reformulate the problem for directed trees. 


Definition 6 A (weighted) directed tree is a weighted connected directed graph with 
each vertex except the terminal verteJ^ having the sole outbound arc and the terminal 
vertex having no outbound arcs. □ 


An arbitrary tree T G WT(n) can be transformed into a directed tree by choosing 
an internal vertex t G M{T), and replacing all its edges with arcs directed towards (a 
terminal vertex) t. Let us denote with WD the collection of all directed trees, which can 
be obtained in such a way, and let >V'D(p., d) stand for all directed trees obtained from 

^Typically it is called a root, but we will use an alternative notation to avoid confusion with a root 
of a pendent-rooted tree introduced in the previous subsection. 







yVT{fi,d). Vice versa, in a directed tree from yW{fi,d) replacement of all arcs with 
edges makes some tree from yVT{fi, d). 

If at Step i = 1, ■■■,q of the generalized Huffman algorithm we add arcs towards the 
vertex rrii (instead of undirected edges), we obtain a directed Huffman tree with the 
terminal vertex 

Definition 7 For a vertex v G V{T) of a directed tree T G WT> define its subordinate 
group griv) C V{T) as the set of vertices having the directed path to the vertex v in the 
tree T (the vertex v itself belongs to griv)). The weight friy) of a subordinate group 
griy) is dehned as the total vertex weight of the group: fxiy) := 'Yl,u&gT{v) h'Tiy). □ 

Note 1 If all pendent vertices in T have positive weights, then friv) > 0 for any v G 
V{T). In particular, it is true for any T G yVT>{fi,d), if weights in {y,d) are degree- 
monotone. 


The Wiener index is dehned for directed trees by analogy to the case of undirected 
trees: we simply ignore the arcs’ direction when calculating distances. Therefore, a tree 
and a corresponding directed tree share the same value of the Wiener index. 

The value of the Wiener index for a directed tree Tt G WT’(p) with a terminal vertex 
t G M{T) can be written II as: 

VWWI[T,)= Y. /!■.(«)(? -/t.W)= E xUtSv)), (21) 

where x{^) •= x{fl — x), and thus, the problems of Wiener index minimization for vertex- 
weighted trees and for weighted directed trees are equivalent. 


Definition 8 Every directed tree T is associated with the vector of subordinate groups ’ 
weights f(T) := (/'r(u))^gv'(r)\{i}, where t is the terminal vertex of T. From equation 
(21) we see that the vector f(T) completely determines the value of VWWI{T). □ 

Definition 9 |20|29 For the real vector x = [xi, ...,Xp), p E N, denote with = 
(xfi], ...,a;[p]) the vector where all components of x are arranged in ascending order. □ 

I A non-negative vector x = (xi, ...,Xp), p eN, weakly majorizes 


Definition 10 20 

a non-negative vector y = (j/i, ...,yp) (which is denoted with x ^ y) if 


^ all k = l,...,p. 

1=1 2 = 1 


If x-f- ^ y^, then x is said to strictly weakly majorize y (which is denoted with x y y). □ 










We will need the following properties of weak majorization. 


Lemma 2 20 


Consider a positive number b > 0 and two non-negative vectors 


X = (xi, ...,Xk,yi, and y = (xi + b, ...,Xk + b,yi - b, ...,yi - b), such that 0 ^l. 

If Xi ^ yi for i = 1, then y. □ 


Lemma 3 |20,2^ //x ^ y and x' -< y', then (x,x') -< (y,y'), where (x,x') means 


concatenation of vectors x and x'. 


Lemma 4 |20|2^ Uxi^) *■5 ® increasing concave function, and {xi, ...,Xp) ^ {yi, ■■■,yp), 
then Y7 i=i x{.Xi) ^ x{.yi)i eguality is possible only when [xi, ...,Xp)^ 

= {yi-, ■■■■, 2/p)t' □ 

The following lemma establishes an important property of directed Huffman trees: 


Lemma 5 |10i For any directed Huffman tree H 


vm,v'm' e E(H), m 7 ^ < fniv') => fnbn) < /»(>«'). 


( 22 ) 


Lemma 6 Consider generating tuples (/r, d) and (/r, d') defined on the same vertex set, 
and let weights be degree-monotone in {y,d). Let the values of degree functions df) and 
d'f) differ only for vertices u and v, such that d{u) ^ d{v) and p.{u) ^ y{v), while 
d'{u) = d{u) +1, d'{v) = d{v) — 1. Then for every directed tree T G WV^p., d) there exists 
such a directed tree T' G yV'D{p,d') that f(T') >- f(T). 

Proof By Theorem 2 from [^, such a directed Huffman tree H G W'D(p,d) exists, 
that f(-H) t f(r)g Since H G WP(/i, d) and d'{v) = d{v) — l ^ 1, we know that d{v) ^2 
and V has an incoming arc in H from some vertex v' & V. Weights are degree-monotone 
in {p, d), therefore, by construction of a Huffman tree, fniu) ^ fuiv) and, thus, without 
loss of generality we can assume that u ^ guiv). 

Assume that v G gniu). Then a path (n,mi, exists in H from the vertex v 

to the vertex u, where I ^ 0. Consider a directed tree T' obtained from H by deleting 
the arc v'v and adding the arc v'u instead. It is clear that T' G WV{p, d'), and weights 
of groups subordinated to vertices v,mi, decrease by fniv') (which is positive by 

Note[^, while weights of the other vertices do not change. Therefore, by Lemma 

y := = 

^Please note the different notation in flO] {x y is used in [lO] where we write x'^y). 













= ifniv) - fniv'), fnirni) - fniv'),fnimi) - fniv')) >- 
y ifHiv)jH{rni),...jHimi)) =: x. 

If one denotes with z the vector of (nnchanged) weights of groups subordinated to all 
other non-terminal vertices of H, then, by Lemmaf(T') = (y,z) (x, z) = 

Assume now that v ^ gniu). Then there are disjoint paths (m, mi,m) and 
(n, mj^, m) (where k,l ^ Qt) m. H from vertices u and v to some vertex m G M{H). 

If fniu) > fniv), then, applying repeatedly formula (22) from Lemma we write 
= 1,..., min[fc,/]. It also follows from ( [2^ that k ^ I, since otherwise 
/^(mj+i) > fnirn), which is impossible, since m/+i G guijn). 

Consider a directed tree T' G yV'D{g,, d!) obtained from H by deleting the arc v'v and 
adding the arc v'u instead. In the tree T' weights of the groups subordinated to the ver¬ 
tices M,mi, ...,mfc increase by fnW) (i.e., /t'(w) = fuiu) fuiv'), frimi) = fnimi) 
fHiv'),i = l,...,k), weights of the groups subordinated to the vertices v,m[, de¬ 

crease by fniv') (i.e., /t'(m) = fniu) - fniv'), /t'«) = fnimi) - fHiv'),i = I,--,/), 
weights of all other vertices (including m) do not change. Therefore, by Lemma 

y := ifTfu),fT'imi),...,fTimk),fT'iv),fTim[),...,fTimi)) = 

= ifniu) + fniv'), fnimi) + fnW ),..., fnimk) + fnW), 
fniv) - fniv'), fnim[) - fniv ),..., fnim'i) - fniv')) >- 
^ ifniu), fnimi), ..., fnimk), fniv), fnim^), ..., fnim'i)) =: x. 

If z is a vector of (unchanged) weights of groups subordinated to all other non-terminal 
vertices of H, then, by Lemma[^ f(T') = (y, z) >- (x, z) = f(iL). 

By construction of the Huffman tree, the situation of fniu) = fniv) is possible only 
when diu) = div) and p(m) = /r(n). In this case we cannot use formula ( |2^ to compare 
subordinate groups’ weights of elements of both chains, since all possible alternatives of 
/c = 0, or Z = 0, or any sign of the expression fnimi) — fnim'i) in case of Zc, Z ^ 1 are 
possible. 


On the other hand, if fnimi) > fnim'i), then formula (22) can be used to show that 


k ^ I, fnimi) > fnim'i), i = 2 ,..., k. In case of the opposite inequality, fnimi) < fnim'i). 


formula (22) says that, by contrast, k ^ I, fnimi) < fnim'f), i = 2, ...,Z. Repeating this 


argument through the chain, we see that only two alternatives are possible: 





• 0 fnimi) = fnirn'i), i = 1, fnimi) > i=p + l ,..., k. In 

this case, as above, we can show that for the directed tree T' G >V'D(/i, d') obtained 
from H by deleting the arc v'v and adding the arc v'u instead, f(T') f(-ff)- 

• 0 ^p ^ k, fnirui) = /h("i'), i = 1, ...,p, fnirrii) < fnirn'i), i = p + 1, In 

this case the same inequality is true for the directed tree T' G d') obtained 

from H by redirecting to/from vertex v all arcs incident to m, and by redirecting 
to/from vertex u all arcs incident to v except the arc v'v. 

Therefore, we proved that a tree T' G yW{p,d') exists such that f(T') f(hf)- As 
shown above, f{H) ^ f(T), so, dually, f(T') f(T). ■ 

Definition 11 A directed tree T G d) with a terminal vertex t is called a proper 

tree if for all m G M{T), m t, fTi'rn) ^ /i/2. 

Lemma 7 Let a function xix) be concave and increasing for x G [0,/i/2]. Consider a pair 
of generating tuples, (p, d) and (/i, d'), satisfying conditions of Lemma^ IfT G yW{p, d) 
and T' G d') are directed Huffman trees, then 

Xifr'iv)) < xifAv)), 

veviT')\{t'} veviT)\{t} 

where t G M{T) and t' G M{T') are terminal vertices of T and T' respectively. 

Proof From Lemmaj^ such a tree T" G yVV{p, d') exists that f(T") f(T). Theorem 2 

from says that f(T') ^ ^{T"), therefore, f(T') f(T). Denote for short ni = |1/(T)|, 

f(T) = f := (/i,..., f(T') = f := (/{,..., 

It is known (see Lemma 19 in [^) that each directed Huffman tree with degree- 
monotone weights is a proper tree, so, friw) ^ /i/2, w G M{p, and fr'iw') ^ p/2, 

w' G M{p,d')\{t'}. If a vertex w G V{T) exists, such that p{w) > p/2 (there can be at 
most one such vertex in V{T)), then w cannot be an internal vertex in T and a pendent 
vertex in T', since then conditions of Lemmaj^imply that w = v and p{u) ^ p{w), which 
is impossible. Therefore, w is either a terminal vertex both in T and in T', or a pendent 
vertex both in T and in T'. In the latter case friv) = fr'iv) = p{v). 

Consequently, /i,/• ^ p/2 for i = l,...,ni - 2, and if f^-i > p/2, then /„j_i = 

//i_i = p{w). 


If fm-i ^ Ai/2, the statement of the lemma follows from Lemma 
If fn,-l > /i/2, we can write 

xiMv)) - Y xifT'iv)) = 


ni—2 ni—2 ni — 2 ni—2 

= Y - Y “ X(/ni-l) = Y M) - Y M)- 

i = l 2 = 1 2=1 2 = 1 

Since f' >- f and fm-i = we have . 2 ) ^ {fi, and the 

statement of the lemma again follows from Lemma ■ 


Corollary 3 If generating tuples (/i, d) and (/i, d') satisfy eonditions of Lemma then 

vwwry^riL.d') < vwwry^riL.d). 


Proof Theorem 3 from says that WPyiy^y/(/i, d) consists of all directed Huffman 
trees. Consider the directed Huffman trees T G >V'r’y^y^j(/i, d), T' G WV*y^Y-^Yj{iJL,d'). 
Function x{^) = ~ (21) satishes the conditions of Lemma so, from (21), 

VWWI{T') < VWWI{T). Since every proper directed tree from W'D{p,,d) has a corre¬ 
sponding tree from WT (/i, d), and vice versa, we have d) = VWWd), 

and the corollary follows immediately. ■ 


In the rest of the section we consider a set V consisting of n vertices with weights /i(u), 
V E V. The set V can be thought of as a hxed collection of (heterogeneous) atoms used 
as building blocks for molecules. All molecules constructed from these building blocks 
belong to WT(/i). 

We want to use Corollary to show that, similar to Theorem the vertex-weighted 
Wiener index is minimized by a tree having as many vertices of the maximum degree 
as possible. We cannot apply Corollary to the whole collection WT (fi) of trees with 
vertices having hxed weights /i(-) (unless /i(u) = const), as it inevitably contains trees 
generated by the tuples with non-degree-monotone weights, for which Corollary is 
inapplicable. Therefore, we have to carefully limit a set of admissible trees. 

Definition 12 Consider an admissible set Ad C WT(/i) and denote with L : = 
f)yg^lP(r) the set of vertices, which are pendent in all trees from Ad. A generat¬ 
ing tuple (/i, d) is called extremal for Ad, if M(/i, d) consists of vertices having the 

highest weights in V\L (i.e., if u G M(/x, d) and v G V\L, then /x(m) ^ Ai('y)), at most one 




vertex u G M(yU, d) has degree d{u) < 4 while others having degree 4, and, when exists, 
u has the minimal weight in M(/i, d). □ 

There can be several extremal tuples for an admissible set, if for some extremal tuple 
(/i, d) such vertices w G W{fi,d)\L and m G M{fi,d) exist that /i(tc) = p(m) (swapping 
w and m then makes a new extremal tuple). It is clear that weights are degree-monotone 
in {fi, d), and there are only extremely branched trees in >VT(/r, d). 

Theorem 3 If {fi, d) is an extremal tuple for an admissible set A4 C WT(/i) of chem¬ 
ical trees with degree-monotone weights, and H G yVT{fi,d) is a Huffman tree, then 
VWWI{H) ^ VWWIfi, with eguality if and only if M. contains H or any other Huff¬ 
man tree generated by the extremal tuple. 

Proof Consider a vertex-weighted tree T & M.. Weights are degree-monotone in T, so, 
if d'r('w) > dTiy) and d{u) < d{y) for some u,v E M(T), then p,{u) = /i(u). Consequently, 
such a tree T' G Wr(/i) exists that VWWI{T) = VWWIiT'), W{T') = W{T), and 
from u,v E M{T) and dT'{u) < dT'iy) it follows that d{u) ^ d{v) (T' is constructed from 
T with a permutation of several vertices of equal weight, which does not affect the index 
value). So, degrees in the tree T' are “compatible” to those in the tuple (/i, d). 

Assume T' E WT{pi, df) for some generating tuple (p, dP), and dPf) ^ d(-). Construct 
such a sequence (p, dP), (/x, d^),..., (/i, d^) of generating tuples with degree-monotone weig¬ 
hts that k ^ 1, d^(-) = d(-), and each pair (/x, d®), of sequential tuples meets the 

requirements of Lemmax = 0,..., fc — 1. 

We build the elements of this sequence one by one. For a tuple (/x, d®) define 
M+(/x,d®) -.= {w E V dl{w) < d{w)} and M~{pi,dl) := {w E V : dl{w) > d(tc)}. 
Set d®’''^(xx®) = d''{u^) + 1 for a vertex xx® having the highest weight in M~^{fi,d^), and 
d®+i(.y®) = d®(x;®) — 1 for a vertex u® having the least weight in M~{pi,,dl), while keeping 
the degrees of all other vertices. 

The proof is clear from Fig. where numbers on arrows show steps of trans¬ 
formation of the initial degree function. By Corollary]^ VWW d) < ... < 

VWWIf^^{pi,d^) ^ VWWIiT') = VWWIiT). From Theorem we know that if 
T G WT(/x, d), then VWWliT) = VWWIf^^ihid) if and only if T is a Huffman tree. 
Therefore, for every T E M we have VWWI^.j-ihjd) ^ VWWIiT) with equality if 
and only if T is a Huffman tree for (/x, d) or another extremal tuple, and the proof is 
complete. ■ 



Figure 3: Sequence of degree functions converging to d{-). A brace shows L. 


Corollary 4 Trees, where WIo{-) achieves its minimum over the set of pendent-rooted 
trees of order n = 4,14, are depicted in Fig. 


Proof As we already argued, WIo{-) can be seen a special case of VWWI{-) for the 
vertex set where all vertices have sufficiently small weight £ except a root having weight 
1/e. Let r be the root in all considered pendent-rooted trees. Then the set of pendent- 
rooted trees satisfies conditions of Theorem with L = {r}, and the statement of the 
corollary follows from Theorem and the fact that for a given n the value of the index 
cannot be improved only for the extremal degree function d{-), the one distinct from 1 
and 4 at no more than one vertex. The concrete degree function for each n is justified 
from (17), as in Theorem From 10 we know that, for a given degree function d{-), 
Huffman trees (shown in Fig. minimize VWWI{-). ■ 


The extension of the above results to trees with vertex degrees limited to any A ^ 3 
is straightforward. 


4 Minimal Boiling Point 

Below we combine the theorems from the previous section to find an alcohol molecule 
with the minimal (predicted) boiling point. The main tool will be the following obvious 
proposition. 











































































Figure 4: Rooted chemical trees minimizing WIo{-) for n = 4,14 (alcohol molecules 
minimizing with oxygens hlled with black) 


Proposition 1 For an arbitrary admissible set Ai of graphs and a pair of indices I{G) 
and J{G) defined for all graphs G G M., if Ai} H Ai*j ^ 0, then AFjj^ j = Ai] fl Ai*j. 

Proof is straightforward, since Ai*^ fl Ai*j is a collection of graphs where indices /(■) 
and J(-) attain their minima simultaneously. ■ 


We start with the simplest Regression I (see Equation (1^) being a linear combination 


of degree-based indices - the “ad-hoc index” C^(-) defined with Equation (10) and S' 2 ( 
index, which penalizes presence of a sub-root of degree 2 in a pendent-rooted tree. 


Proposition 2 The collection TZ*^pi{n) of pendent-rooted trees {alcohol molecules) with 
the minimal predicted boiling point BP^ for n = 4,..., 14 consists of the trees depicted in 
with black circles representing a possible root {oxygen atom) positions in a tree 
{alcohol molecule). 


Proof Using Table one easily checks conditions (12)-(14) of Theorem to hold for 
the “ad-hoc index” C^(-) and n ^ 14. Therefore, by Corollary trees from P^i{n) are 
depicted in Fig. We know that the “ad-hoc” index does not care for the root position 
in a tree, and, therefore, lZ*^i{n) can be obtained from T^i{n) by assigning the root to 
any pendent vertex of all optimal trees. 





From Lemma *S' 2 (-) is minimized with pendent-rooted trees whose sub-root has 
degree 3 or 4. Each tree in Fig. has a pendent vertex being incident to a vertex of 
degree 3 or 4, so, fl 'n*s^{n) ^ 0, and conditions of Proposition hold for all 

n = 4,14. Therefore, TZ*^pi{n) consists of the trees from Fig. [^with the root assigned 
to a pendent vertex incident to a vertex of degree 3 or 4. Possible roots are hlled with 
black color in Fig. ■ 

The similar reasoning can be carried out for Regression II, which is dehned with 
Equation (|^ and adds up from the generalized hrst Zagreb index the cube root of 

the distance WIo{-)^ of the oxygen atom in an alcohol molecule, and, again, from S 2 {-) 
index. 

Proposition 3 The collection TZ*ppii{n) of rooted trees {alcohol molecules) with the min¬ 
imal predicted boiling point for n = 4,..., 14 consists of the trees depicted in Fig. 
with a black circle representing a root {oxygen atom) position in a tree {alcohol molecule). 


Proof Using Table we check that conditions ([IT|)-([I^ of Theorem hold, therefore, 
from Corollary the generalized hrst Zagreb index is minimized with any ex¬ 

tremely branched tree. 

According to Corollary {n) includes only extremely branched trees depicted 

in Fig. 1^ Since the cube root is a monotone function, by Proposition ~ 

'^wio (^) • 

The sub-root of all trees depicted in Fig. |^has degree 3 or 4. Therefore, 'R-wio{n) fl 
"^ 52 (^) ~ by Proposition!^ TZ*ppii{n) consists of rooted trees from 

Fig-i with roots hlled black. ■ 

The Basic regression (see Equation ([^) combines the “ad-hoc” index C°(-), the cube 
root of the oxygen’s distance WIo{-)p and 5*2(■) index. From Corollary we know that 
WIo{-) is minimized with an extremely branched tree, and so is hF/o(-)^. At the same 
time, neither of the inequalities (|n|)-(|g) from Theoremholds for C'°(-), and we cannot 
be sure that C°(-) is minimized with an extremely branched tree. Therefore, we cannot 
prove formally that BP^{-) achieves its minimum at some extremely branched tree. 

Nevertheless, since for both simplihed regressions {BP^ and BP^^) an extremely 
branched tree appears to be optimal, optimality of an extremely branched tree for the 


Basic regression BP^{-) is believed to be a credible hypothesis, which we state below 
formally. 

Conjecture 1 If BP^{T) = BPff{n) for some pendent-rooted tree T G TZ{n), then T is 
an extremely branched tree. 

Proposition 4 If Conjecture is supposed to hold, the collection IZ*^po{n) of rooted 
trees {alcohol molecules) with the minimal predicted boiling point BP^ for n = 4,14 
consists of the trees depicted in Fig. with a black circle representing a root {oxygen 
atom) position in a tree {alcohol molecule). 

Proof From Conjecture [T| i?P‘^-minimizer is an extremely branched tree. From Fig. 
and|^we see that for n = 4, 5, 6,8 optimal trees for WIo{-) and for C°(-) coincide, while 
for n = 7,11,14 PwiQ{n) C Ilf,o{n). Since all sub-roots in Pwio{^) have degree 3 or 4, 
from Proposition we conclude that Pppo {n) = Tlwio 

For n = 9,10,12,13, if Conjecture[T]is assumed to hold, both IF/o-minimizers and C°- 
minimizers are extremely branched trees depicted in Fig. an ag but Tl\yj^{n)mZ*^o{n) = 
0, and we cannot use Proposition [T] At the same time, the sets of extremely branched 
rooted trees of order n are remarkably small for n = 9,10,12,13, and are completely 
enumerated in Fig. along with their predicted normal boiling points The tree 

with the least boiling point for each n is framed in Fig. Combining the above hndings 
we obtain Fig. ■ 

Predictions of the minimal boiling point based on the Wiener index slightly differ 
from those based on the “ad-hoc” index C'^(-). Minimizers of both indices are extremely 
branched trees. For n = 4, 5,6,8 optimal trees for WIo{-) and for C'^(-) coincide. For 
n = 7,11,14 IPwi^{n) C 77^i(n), so, WIo{-) appears to be a rehnement of and 

Regression II rehnes Regression I. 

For n = 9,10,12,13 minimizers of WIo{ ) and of C^(-) differ. By Corollaries and 
1^ extremely branched trees minimize both indices. They have the only internal vertex 
V of non-maximal degree, but C^(-) suggests settling this vertex in the very center of a 
molecule surrounding it with the other internal vertices, while WIo{-) says vertex v must 
be a stem vertex incident to only one internal vertex. Therefore, predictions of Regression 
II differ from those of Regression I. 



Figure 5: Pendent-rooted chemical trees (alcohol molecules) minimizing BP^{-) for n = 
4,14 with black circles being roots (oxygen atoms) 
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Figure 6: Extremely branched pendent-rooted trees of order n = 9,10,12,13 and their 
BP^ values (minima are framed) 
























Basic regression, which contains a weighted linear combination of the Wiener index 
and the “ad-hoc” index, represents a sort of intermediate behavior between these extremal 
trends (at least for extremely branched trees under Conjecture [^. The weight of the 
Wiener index in the regression is not enough to move the minimal tree sufficiently from 
the trees depicted in Fig. and Basic regression becomes a yet another refinement of 
Regression I. 

5 Conclusion 

The focus of this paper is development of optimization techniques for combinations of 
some well-known and novel topological indices over chemically interesting sets of graphs. 
We derived conditions under which an extremely branched tree minimizes the sum of the 
second Zagreb index and of the generalized first Zagreb index. We also found minimizers 
of the vertex-weighted Wiener index over the set of chemical trees with given vertex 
weights. 

We enumerated index minimizers for moderate (up to 14) non-hydrogen atom count 
in a molecule, and combined them in several regressions of different complexity to forecast 
a simple alcohol molecule with the lowest boiling point. 

For simpler regressions (Regressions I and II) we managed to obtain a complete an¬ 
alytical characterization of extremal alcohol molecules, while for the most complex (yet 
the most precise) “basic” regression we had to limit our attention to extremely branched 
trees (see Conjecture and employed the brute-force enumeration to find molecules of 
low-boiling alcohols. 

Forecasts based on different regressions slightly differ, but they all comply with the 
collected experimental data on normal boiling points of simple alcohols. 

Finally, let us sketch several promising directions of future research. An obvious 
shortcoming of this paper is Conjecture [T| which is explained but is not proven formally. 
To justify it, we need to refine sufficiently our optimization techniques, viz, to optimize 
jointly the Wiener index and the Zagreb indices. 

On the other hand, there is wide space for investigation of popular combinations of 
topological indices forecasting important physical and chemical properties of compounds. 
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